Integrating a Verb Lexicon into a Syntactic Treebank Production

نویسنده

  • Milena Slavcheva
چکیده

The creation of linguistically interpreted corpora is a tedious task and the automation of the annotation process is indispensable. A fully automated annotation is hardly possible to achieve, since it requires very sophisticated and large knowledge bases which are, themselves difficult to create. However, a "machine-aided approach" to the annotating process, using as many as available sources of linguistic information is justified and desirable. The creation of an HPSG-based syntactic treebank of Bulgarian [18] is a process of incremental augmentation of the real-world sentences with linguistic annotation as the result of linguistic analysis at different processing levels. At this stage of the treebank production, there are the following main sources of linguistic knowledge that can be integrated into a grammar for automated chunk connection and assignment of grammatical relations:

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Verb Valency Descriptors for a Syntactic Treebank

An essential component of Language Engineering (LE) tools are verb class descriptors that provide information about the relations of the predicates to their arguments. The production of computationally tractable language resources necessitates the assignment of types of predicate-argument relations to a great variety of verb-centered structures: it is necessary to define not only the initial, c...

متن کامل

A Syntactic Valency Lexicon for Persian Verbs: The First Steps towards Persian Dependency Treebank

Valency lexicons are valuable resources for natural language processing. The need for new resources for languages encourages researchers to collect new datasets. One of the most important datasets is valency lexicons. In valency lexicons, information about obligatory and optional complements of words is annotated at the syntactic and semantic levels. In this paper, we report the development of ...

متن کامل

A Treebank-driven Creation of an OntoValence Verb lexicon for Bulgarian

The paper presents a treebank-driven approach to the construction of a Bulgarian valence lexicon with ontological restrictions over the inner participants of the event. First, the underlying ideas behind the Bulgarian Ontology-based lexicon are outlined. Then, the extraction and manipulation of the valence frames is discussed with respect to the BulTreeBank annotation scheme and DOLCE ontology....

متن کامل

VerbaLex – New Comprehensive Lexicon of Verb Valencies for Czech

The paper presents new lexicon of verb valencies for the Czech language named VerbaLex. VerbaLex is based on three valuable language resources for Czech, three independent electronic dictionaries of verb valency frames. The first resource, Czech WordNet valency frames dictionary, was created during the Balkanet project and contains semantic roles and links to the Czech WordNet semantic network....

متن کامل

Syntactic-Semantic Classes of Context-Sensitive Synonyms Based on a Bilingual Corpus

This paper summarizes first findings of a three-year study (an ongoing research project) on verb synonymy based on both syntactic and semantic criteria. Primary language resources used for the study are existing lexical and corpus resources, namely the Prague Dependency Treebank-style valency lexicons, FrameNet, VerbNet, PropBank and Czech and English WordNets and the parallel Prague Czech-Engl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003